NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework

Chen, Yuhang; Tan, Zhen; Jaiswal, Ajay; Qu, Huaizhi; Zhao, Xinyu; Lin, Qi; Cheng, Yu; Kwong, Andrew; Cao, Zhichao; Chen, Tianlong (November 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing)

Free, publicly-accessible full text available November 4, 2026
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications

Jaiswal, Ajay; Wang, Yifan; Yin, Lu; Liu, Shiwei; Chen; Runjin; Zhao, Jiawei; Grama, Ananth; Tian, Yuandong; Wang, Zhangyang (July 2025, International Conference on Machine Learning (ICML))

Free, publicly-accessible full text available July 13, 2026
CancerGPT for few shot drug pair synergy prediction using large pretrained language models

https://doi.org/10.1038/s41746-024-01024-9

Li, Tianhao; Shetty, Sandesh; Kamath, Advaith; Jaiswal, Ajay; Jiang, Xiaoqian; Ding, Ying; Kim, Yejin (December 2024, npj Digital Medicine)

Abstract Large language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology and medicine has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structured data and sample size are limited, by extracting prior knowledge from text corpora. Here we report our proposed few-shot learning approach, which uses LLMs to predict the synergy of drug pairs in rare tissues that lack structured data and features. Our experiments, which involved seven rare tissues from different cancer types, demonstrate that the LLM-based prediction model achieves significant accuracy with very few or zero samples. Our proposed model, the CancerGPT (with ~ 124M parameters), is comparable to the larger fine-tuned GPT-3 model (with ~ 175B parameters). Our research contributes to tackling drug pair synergy prediction in rare tissues with limited data, and also advancing the use of LLMs for biological and medical inference tasks.
more » « less
Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian; Jaiswal, Ajay; Xu, Kaidi; et al (July 2024, International Conference on Machine Learning (ICML 2024))

Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian; Jaiswal, Ajay; Xu, Kaidi; et al (July 2024, International Conference on Machine Learning (ICML))

Full Text Available
Physics-Driven Turbulence Image Restoration with Stochastic Refinement

https://doi.org/10.1109/ICCV51070.2023.01118

Jaiswal, Ajay; Zhang, Xingguang; Chan, Stanley H; Wang, Zhangyang (October 2023, IEEE)

Full Text Available
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Hong, Junyuan; Duan, Jinhao; Zhang, Chenhui; Li, Zhangheng; Xie, Chulin; Lieberman, Kelsey; Diffenderfer, James; Bartoldson, Brian R; Jaiswal, Ajay Kumar; Xu, Kaidi; et al (July 2024, Proceedings of Machine Learning Research)

Compressing high-capability Large Language Models (LLMs) has emerged as a favored strategy for resource-efficient inferences. While state-of-the-art (SoTA) compression methods boast impressive advancements in preserving benign task performance, the potential risks of compression in terms of safety and trustworthiness have been largely neglected. This study conducts the first, thorough evaluation of three (3) leading LLMs using five (5) SoTA compression techniques across eight (8) trustworthiness dimensions. Our experiments highlight the intricate interplay between compression and trustworthiness, revealing some interesting patterns. We find that quantization is currently a more effective approach than pruning in achieving efficiency and trustworthiness simultaneously. For instance, a 4-bit quantized model retains the trustworthiness of its original counterpart, but model pruning significantly degrades trustworthiness, even at 50% sparsity. Moreover, employing quantization within a moderate bit range could unexpectedly improve certain trustworthiness dimensions such as ethics and fairness. Conversely, extreme quantization to very low bit levels (3 bits) tends to reduce trustworthiness significantly. This increased risk cannot be uncovered by looking at benign performance alone, in turn, mandating comprehensive trustworthiness evaluation in practice. These findings culminate in practical recommendations for simultaneously achieving high utility, efficiency, and trustworthiness in LLMs.
more » « less
Full Text Available
Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge

https://doi.org/10.1016/j.media.2024.103224

Holste, Gregory; Zhou, Yiliang; Wang, Song; Jaiswal, Ajay; Lin, Mingquan; Zhuge, Sherry; Yang, Yuzhe; Kim, Dongkyun; Nguyen-Mau, Trong-Hieu; Tran, Minh-Triet; et al (October 2024, Medical Image Analysis)

Full Text Available
Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and a New Physics-Inspired Transformer Model

https://doi.org/10.1007/978-3-031-19800-7_25

Mao, Zhiyuan; Jaiswal, Ajay; Wang, Zhangyang; Chan, Stanley H. (November 2022, European Conference on Computer Vision)

Full Text Available
FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude

https://doi.org/10.1109/WACV57701.2024.00611

Liu, Feng; Ashbaugh, Ryan; Chimitt, Nicholas; Hassan, Najmul; Hassani, Ali; Jaiswal, Ajay; Kim, Minchul; Mao, Zhiyuan; Perry, Christopher; Ren, Zhiyuan; et al (January 2024, IEEE)

Full Text Available

« Prev Next »

Search for: All records